RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: /o/op/ ; 

Source: 

Date Processed by STIC: 3/2GpD^ 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703- 308-4216. 
PATENTIN 2.1 e-mail help: patin21heIp@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3heIp@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://vrmv.uspto,gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documen ts.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 191 1 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
20 1 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 

Revised 01/29/2002 




Raw Sequence Listing Error Summary 



/o/ool , Wi 



ERROR! DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: WEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA** HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

1 ^ Wrapped Nucleics The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 
Wrapped Aminos was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping." 

2 I nvalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



3 t/^ Misaligned Amino 
Numbering 

4 Non- ASCII 



^Variable Length 



6 Patentln 2.0 

"bug" 



8 Skipped Sequences 

(NEW RULES) 



Use of n's or Xaa*s 



10 



11 



12 



(NEW RULES) 



Jnvalid <213> 
Response 



Useof<220> 



Patentln 2.0 
"bug" 



The numbering under each 5 th amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text. 

Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



_Skipped Sequences Sequences) 
(OLD RULES) 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ ID NO where "X'Ms shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



Sequence(s) _ 



missing. If intentional, please insert the following lines for each skipped sequence. 



<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence tfules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence ■••■.«■ 



Sequence(s) _ 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 Misuse of n n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 

AMC/MH - Biotechnology Systems Branch - 08/21/2001 
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OIPE 



RAW SEQUENCE LISTING DATE: 03/25/2002 

PATENT APPLICATION: US/10/001,486 TIME: 14:48:40 

Input Set : A:\W065457.txt 

Output Set: N:\CRF3\0 3252002\J0014 86.raw ^ DOSS Not Gompfy 

sequence listing . ^orrecfccf Diskette Needed 

4 (1) GENERAL INFORMATION: 

6 (i) APPLICANT: The Government of the United States of America 

7 AjJbJbL<*£ as represented by the Secretary 

8 Department of Health and Human Services 

9 Washington, D.C. 

10 Htun Ph.D. , Han 

11 Hager Ph.D., Gordon L. 

14 (ii) TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR MONITORING 

15 DNA BINDING MOLECULES IN LIVING CELLS 
17 (iii) NUMBER OF SEQUENCES: 9 

19 (iv) CORRESPONDENCE ADDRESS: 

2 0 (A) ADDRESSEE: Needle & Rosenberg 

21 (B) STREET: 127 Peachtree Street, Suite 1200 

22 (C) CITY: Atlanta 

23 (D) STATE: Georgia 

24 (E) COUNTRY: USA 

25 (F) ZIP: 30303 

27 (V) COMPUTER READABLE FORM: 

2 8 (A) MEDIUM TYPE: Floppy disk 

2 9 (B) COMPUTER: IBM PC compatible 

30 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

31 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 

3 3 (vi) CURRENT APPLICATION DATA: 

34 (A) APPLICATION NUMBER: US/10/001,4 86 

35 (B) FILING DATE: ll-Mar-2002 

41 (C) CLASSIFICATION: 

3 8 (vii) PRIOR APPLICATION DATA: 

39 (A) APPLICATION NUMBER: 60/008,373 

40 (B) FILING DATE: 08 Dec 1995 

4 3 (viii) ATTORNEY/AGENT INFORMATION: 

44 (A) NAME: Selby, Elizabeth 

45 (B) REGISTRATION NUMBER: 38298 

46 (C) REFERENCE/DOCKET NUMBER: 14014.0183 
4 8 (ix) TELECOMMUNICATION INFORMATION: 

49 (A) TELEPHONE: 404-688-0770 

50 (B) TELEFAX: 404-688-9880 



ERRORED SEQUENCES 

53 (2) INFORMATION FOR SEQ ID NO: 1: 
55 (i) SEQUENCE CHARACTERISTICS: 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/001 , 4 86 



DATE: 03/25/2002 
TIME: 14:48:40 




E 
E 
E 
E 
E 
E 
E 
E 
E 
E 
E 
E 
E 
E 
E 
E- 
E - 
E- 



E- 
W- 



Input Set : A:\W065457.txt 

Output Set: N:\CRF3\03252002\J001486.raw 

(A) LENGTH: 7257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
MOLECULE TYPE: nucleic acid 
FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1072.. 4284 
SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CCA TATTATTCAT TGGTTATATA GCATAAATCA 



TTGGCCATTj 

74 ( jgCTCATG 



TT^^A^ACG^TG 

Tec _ ,1 20 - 



TGT ATCTATATCA TAATATGTAC ATTTATATTG 




--> 



--> 



--> 



--> 





76 AATATGACCG CCATGTTGGC ATTGATTATT GACTAGTTAT TAATAGTAAT 

77 CAATTACGGG 180 

79 GTCATTAGTT CAT AG C C CAT ATATGGAGTT CCGCGTTACA TAACTTACGG 

80 TAAATGGCCC 240 

82 GCCTGGCTGA CCGCCCAACG ACCCCCGCCC ATTGACGTCA ATAATGACGT 

83 ATGTTCCCAT 3 00 

85 AGTAACGCCA ATAGGGACTT TCCATTGACG TCAATGGGTG GAGTATTTAC 

8 6 GGTAAACTGC 3 60 

88 CCACTTGGCA GTACATCAAG TGTATCATAT GCCAAGTCCG CCCCCTATTG 

89 ACGTCAATGA 420 

91 CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTACGGGACT 

92 TTCCTACTTG 480 

94 GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT 

95 GGCAGTACAC 540 

97 CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC 

98 CCATTGACGT 600 

100 CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC 

101 GTAATAACCC 6 60 

103 CGCCCCGTTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA 

104 TAAGCAGAGC 720 

106 TCGTTTAGTG AACCGTCAGA TCACTAGAAG CTTTATTGCG GTAGTTTATC 

107 ACAGTTAAAT 780 

109 TGCTAACGCA GTCAGTGCTT CTGACACAAC AGTCTCGAAC TTAAGCTGCA 

110 GAAGTTGGTC 84 0 

112 GTGAGGCACT GGGCAGGTAA GTATCAAGGT TACAAGACAG GTTTAAGGAG 

113 ACCAATAGAA 900 
-> 115 ACTGGGCTTG TCGAGACAGA GAAGACTCTT GCGTTTCTGA TAGGCACCTA 

116 TTGGTCTTAC 960 
-> 118 TGACATCCAC TTTGCCTTTC TCTCCACAGG TGTCCACTCC CAGTTCAATT 

119 ACAGCTCTTA 1020 
-> 121 AGGCTAGAGT ACTTAATACG ACTCACTATA GGCTAGCGAA GGAGATCCGC C ATG 

122 GCC 1077 

123 

124 

-> 126 CAC CAT CAC CAC CAT CAC GGA TAT CCA TAC GAC GTG CCA GAT TAC GCT 
-> 127 1125 



(f}\ 




Met Ala 
1 
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RAW SEQUENCE LISTING DATE: 03/25/2002 

PATENT APPLICATION: US/10/001 f 4 86 TIME: 14:48:40 

Input Set : A:\W065457.txt 

Output Set: N:\CRF3\03252002\J0014 86.raw 



128 His His His His His His Gly Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
W--> 129 5 10 15 

E--> 131 CAG TCG AGT GCC ATG AGT AAA GGA GAA GAA CTT TTC ACT GGA GTT GTC 
W--> 132 1173 

133 Gin Ser Ser Ala Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val 
W--> 134 20 25 30 

E--> 137 CCA ATT CTT GTT GAA TTA GAT GGT GAT GTT AAT GGG CAC AAA TTT TCT 
W--> 138 1221 

139 Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser 
W--> 140 35 40 45 50 

E--> 142 GTC AGT GGA GAG GGT GAA GGT GAT GCA ACA TAC GGA AAA CTT ACC CTT 
W--> 143 1269 

144 Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu 
W--> 145 55 60 65 

E--> 147 AAA TTT ATT TGC ACT ACT GGA AAA CTA CCT GTT CCT TGG CCA ACA CTT 
W--> 148 1317 

149 Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu 
W--> 150 70 75 80 

E--> 152 GTC ACT ACT TTC ACT TAT GGT GTT CAA TGC TTT TCA AGA TAC CCA GAT 
W--> 153 1365 

154 Val Thr Thr Phe Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp 
W--> 155 85 90 95 

E--> 157 CAT ATG AAA CAG CAT GAC TTT TTC AAG AGT GCC ATG CCC GAA GGT TAT 
W--> 158 1413 

159 His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr 
W--> 160 100 105 110 

E--> 162 GTA CAG GAA AGA ACT ATA TTT TTC AAA GAT GAC GGG AAC TAC AAG ACA 
W--> 163 1461 

164 Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr 
W--> 165 115 120 125 130 

E--> 167 CGT GCT GAA GTC AAG TTT GAA GGT GAT ACC CTT GTT AAT AGA ATC GAG 
W--> 168 1509 

169 Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu 
W--> 170 135 140 145 

E--> 172 TTA AAA GGT ATT GAT TTT AAA GAA GAT GGA AAC ATT CTT GGA CAC AAA 
W--> 173 1557 

174 Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys 
W--> 175 150 155 160 

E--> 177 TTG GAA TAC AAC TAT AAC TCA CAC AAT GTA TAC ATC ATG GCA GAC AAA 
W--> 178 1605 

179 Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys 
W--> 180 165 170 175 

E--> 182 CAA AAG AAT GGA ATC AAA GTT AAC TTC AAA ATT AGA CAC AAC ATT GAA 
W--> 183 1653 

184 Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu 
W--> 185 180 185 190 

E--> 187 GAT GGA AGC GTT CAA CTA GCA GAC CAT TAT CAA CAA AAT ACT CCA ATT 
W--> 188 1701 

189 Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/001, 486 



DATE: 03/25/2002 
TIME: 14:48:40 



Input Set : A:\W065457.txt 

Output Set: N:\CRF3\03252002\J001486.raw 

W--> 190 195 200 205 210 

E--> 192 GGC GAT GGC CCT GTC CTT TTA CCA GAC AAC CAT TAC CTG TCC ACA CAA 
W--> 193 1749 

194 Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin 
W--> 195 215 220 225 

E--> 197 TCT GCC CTT TCG AAA GAT CCC AAC GAA AAG AGA GAC CAC ATG GTC CTT 
W--> 198 1797 

199 Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu 
W— > 200 230 235 240 

E--> 202 CTT GAG TTT GTA ACA GCT GCT GGG ATT ACA CAT GGC ATG GAT GAA CTA 
W--> 203 1845 

204 Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu 
W--> 205 245 250 255 

E--> 209 TAC AAA GGC GCC GGC GCT GGT GCT GGT GCT GGC GCC ATC AGC GCG CTG 
W— > 210 1893 

211 Tyr Lys Gly Ala Gly Ala Gly Ala Gly Ala Gly Ala lie Ser Ala Leu 
W--> 212 260 265 270 

E--> 214 ATC CTG GAC TCC AAA GAA TCC TTA GCT CCC CCT GGT AGA GAC GAA GTC 
W--> 215 1941 

216 lie Leu Asp Ser Lys Glu Ser Leu Ala Pro Pro Gly Arg Asp Glu Val 
W--> 217 275 280 285 290 

E--> 219 CCT GGC AGT TTG CTT GGC CAG GGG AGG GGG AGC GTA ATG GAC TTT TAT 
W--> 220 1989 

221 Pro Gly Ser Leu Leu Gly Gin Gly Arg Gly Ser Val Met Asp Phe Tyr 
W--> 222 295 300 305 

E--> 224 AAA AGC CTG AGG GGA GGA GCT ACA GTC AAG GTT TCT GCA TCT TCG CCC 
W--> 225 2037 

226 Lys Ser Leu Arg Gly Gly Ala Thr Val Lys Val Ser Ala Ser Ser Pro 
W--> 227 310 315 320 

E--> 229 TCA GTG GCT GCT GCT TCT CAG GCA GAT TCC AAG CAG CAG AGG ATT CTC 
W--> 230 2085 

231 Ser Val Ala Ala Ala Ser Gin Ala Asp Ser Lys Gin Gin Arg lie Leu 
W--> 232 325 330 335 

E--> 234 CTT GAT TTC TCG AAA GGC TCC ACA AGC AAT GTG CAG CAG CGA CAG CAG 
W--> 235 2133 

236 Leu Asp Phe Ser Lys Gly Ser Thr Ser Asn Val Gin Gin Arg Gin Gin 
W--> 237 340 345 350 

E--> 239 CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG 
W--> 240 2181 

241 Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin 
W--> 242 355 360 365 370 

E--> 244 CAG CCA GGC TTA TCC AAA GCC GTT TCA CTG TCC ATG GGG CTG TAT ATG 
W--> 245 2229 

24 6 Gin Pro Gly Leu Ser Lys Ala Val Ser Leu Ser Met Gly Leu Tyr Met 
W--> 247 375 380 385 

E--> 24 9 GGA GAG ACA GAA ACA AAA GTG ATG GGG AAT GAC TTG GGC TAC CCA CAG 
W--> 250 2277 

251 Gly Glu Thr Glu Thr Lys Val Met Gly Asn Asp Leu Gly Tyr Pro Gin 
W--> 252 390 395 400 
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RAW SEQUENCE LISTING DATE: 03/25/2002 

PATENT APPLICATION: US/10/001, 486 TIME: 14:48:40 

Input Set : A:\W065457.txt 

Output Set: N:\CRF3\03252002\J0014 86.raw 

E--> 2 54 CAG GGC CAA CTT GGC CTT TCC TCT GGG GAA ACA GAC TTT CGG CTT CTG 
W--> 255 2325 

256 Gin Gly Gin Leu Gly Leu Ser Ser Gly Glu Thr Asp Phe Arg Leu Leu 
W--> 257 405 410 415 

E--> 259 GAA GAA AGC ATT GCA AAC CTC AAT AGG TCG ACC AGC GTT CCA GAG AAC 
W--> 260 2373 

261 Glu Glu Ser lie Ala Asn Leu Asn Arg Ser Thr Ser Val Pro Glu Asn 
W--> 262 420 425 430 

E--> 264 CCC AAG AGT TCA ACG TCT GCA ACT GGG TGT GCT ACC CCG ACA GAG AAG 
W--> 265 2421 

266 Pro Lys Ser Ser Thr Ser Ala Thr Gly Cys Ala Thr Pro Thr Glu Lys 
W--> 267 435 440 445 450 

E--> 269 GAG TTT CCC AAA ACT CAC TCG GAT GCA TCT TCA GAA CAG CAA AAT CGA 
W--> 270 2469 

271 Glu Phe Pro Lys Thr His Ser Asp Ala Ser Ser Glu Gin Gin Asn Arg 
W--> 272 455 460 465 

E--> 274 AAA AGC CAG ACC GGC ACC AAC GGA GGC AGT GTG AAA TTG TAT CCC ACA 
W--> 275 2517 

27 6 Lys Ser Gin Thr Gly Thr Asn Gly Gly Ser Val Lys Leu Tyr Pro Thr 
W--> 277 470 475 480 

E--> 281 GAC CAA AGC ACC TTT GAC CTC TTG AAG GAT TTG GAG TTT TCC GCT GGG 
W--> 282 2565 

283 Asp Gin Ser Thr Phe Asp Leu Leu Lys Asp Leu Glu Phe Ser Ala Gly 
W--> 284 485 490 495 

E--> 2 86 TCC CCA AGT AAA GAC ACA AAC GAG AGT CCC TGG AGA TCA GAT CTG TTG 
W--> 287 2613 

288 Ser Pro Ser Lys Asp Thr Asn Glu Ser Pro Trp Arg Ser Asp Leu Leu 
W--> 289 500 505 510 

E--> 291 ATA GAT GAA AAC TTG CTT TCT CCT TTG GCG GGA GAA GAT GAT CCA TTC 
W--> 292 2661 

293 lie Asp Glu Asn Leu Leu Ser Pro Leu Ala Gly Glu Asp Asp Pro Phe 
W--> 294 515 520 525 530 

E--> 296 CTT CTC GAA GGG AAC ACG AAT GAG GAT TGT AAG CCT CTT ATT TTA CCG 
W--> 297 2709 

298 Leu Leu Glu Gly Asn Thr Asn Glu Asp Cys Lys Pro Leu lie Leu Pro 
W--> 299 535 540 545 

E--> 3 01 GAC ACT AAA CCT AAA ATT AAG GAT ACT GGA GAT ACA ATC TTA TCA AGT 
W--> 302 2757 

303 Asp Thr Lys Pro Lys lie Lys Asp Thr Gly Asp Thr lie Leu Ser Ser 
W--> 304 550 555 560 

E--> 306 CCC AGC AGT GTG GCA CTA CCC CAA GTG AAA ACA GAA AAA GAT GAT TTC 
W--> 307 2805 

30 8 Pro Ser Ser Val Ala Leu Pro Gin Val Lys Thr Glu Lys Asp Asp Phe 
W--> 309 565 570 575 

E--> 311 ATT GAA CTT TGC ACC CCC GGG GTA ATT AAG CAA GAG AAA CTG GGC CCA 
W--> 312 2853 

313 lie Glu Leu Cys Thr Pro Gly Val lie Lys Gin Glu Lys Leu Gly Pro 
W--> 314 580 585 590 

E--> 316 GTT TAT TGT CAG GCA AGC TTT TCT GGG ACA AAT ATA ATT GGT AAT AAA 
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RAW SEQUENCE LISTING DATE: 03/25/2002 

PATENT APPLICATION: US/10/001, 486 TIME: 14:48:40 



Input Set : A:\W0654 57.txt 

Output Set: N:\CRF3\03252002\J001486.raw 
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349 








VJ _7 J 










100 










7 0^ 




E- 


- > 


1 SI 

-j j 0 


GCC ACG 


GGA 


PPA 


PPT 

v*.v«.X 


PPP 


A AP 


PTP 


TPP 


PTP 


PTP 
\s Xw 


TPP 


TPP 

ILL. 


PAT 


PAA 


PPT 


W- 


- > 


1 S4 


3237 


































1 ss 


Ala Thr 


Gly 




r^X (_> 


Pro 


Lys 


Leu 


L^y t> 


Leu 


Val 




Car 
ocl 


Asp 


PI n 
Vjj 1 u 


A 1 3 
Hid. 


W- 


- > 


356 






710 

/IV 










71 "5 










790 








_ > 


1 Sft 

0 ~j 0 


TCA GGA 


TGT 


PAT 


TAP 


ccc 


vjXvj 


PTP 


AP A 


TPT 
ibl 


PPA 


APP 


TPP 


AAA 


PTA 
«J lh 


TTP 
lit 


TV7- 
rV 


- > 


0 c q 


3285 


































.3 0 u 


Ser Gly 


Cys 


rixS 


Tyr 


P T \r 


val 


Leu 


Thr 


Cys 


<aiy 


Ser 


Cys 


Lys 


v ax 


riie 


w- 


-> 


361 




725 










730 










735 








E- 


-> 


363 


TTT AAA 


AGA 


GCA 


GTG 


GAA 


GGA 


CAG 


CAC 


AAT 


TAC 


CTT 


TGT 


GCT 


GGA 


AGA 


W- 


-> 


364 


3333 


































365 


Phe Lys 


Arg 


Ala 


Val 


Glu 


Gly 


Gin 


His 


Asn 


Tyr 


Leu 


Cys 


Ala 


Gly 


Arg 


w- 


-> 


366 


740 










745 










750 










E- 


-> 


368 


AAC GAT 


TGC 


ATC 


ATT 


GAT 


AAA 


ATT 


CGA 


AGG 


AAA 


AAC 


TGC 


CCA 


GCA 


TGC 


W- 


-> 


369 


3381 


































370 


Asn Asp 


Cys 


He 


He 


Asp 


Lys 


He 


Arg 


Arg 


Lys 


Asn 


Cys 


Pro 


Ala 


Cys 


W- 


~> 


371 


755 








760 










765 










770 


E- 


-> 


373 


CGC TAT 


CGG 


AAA 


TGT 


CTT 


CAG 


GCT 


GGA 


ATG 


AAC 


CTT 


GAA 


GCT 


CGA 


AAA 


W- 


-> 


374 


3429 


































375 


Arg Tyr 


Arg 


Lys 


Cys 


Leu 


Gin 


Ala 


Gly 


Met 


Asn 


Leu 


Glu 


Ala 


Arg 


Lys 


w- 


-> 


376 








775 










780 










785 




E- 


-> 


378 


ACA AAG 


AAA 


AAA 


ATC 


AAA 


GGG 


ATT 


CAG 


CAA 


GCC 


ACT 


GCA 


GGA 


GTC 


TCA 


W- 


-> 


379 


3477 
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Output Set: N:\CRF3\03252002\J001486.raw 

380 Thr Lys Lys Lys lie Lys Gly He Gin Gin Ala Thr Ala Gly Val Ser 
W--> 381 790 795 800 

E--> 3 83 CAA GAC ACT TCG GAA AAT CCT AAC AAA ACA ATA GTT CCT GCA GCA TTA 
W--> 384 3525 

385 Gin Asp Thr Ser Glu Asn Pro Asn Lys Thr He Val Pro Ala Ala Leu 
W--> 386 805 810 815 

E--> 3 88 CCA CAG CTC ACC CCT ACC TTG GTG TCA CTG CTG GAG GTG ATT GAA CCC 
W--> 389 3573 

390 Pro Gin Leu Thr Pro Thr Leu Val Ser Leu Leu Glu Val He Glu Pro 
W--> 391 820 825 830 

E--> 3 93 GAG GTG TTG TAT GCA GGA TAT GAT AGC TCT GTT CCA GAT TCA GCA TGG 
W--> 394 3621 

3 95 Glu Val Leu Tyr Ala Gly Tyr Asp Ser Ser Val Pro Asp Ser Ala Trp 
W--> 396 835 840 845 850 

E--> 398 AGA ATT ATG ACC ACA CTC AAC ATG TTA GGT GGG CGT CAA GTG ATT GCA 
W--> 399 3669 

400 Arg He Met Thr Thr Leu Asn Met Leu Gly Gly Arg Gin Val He Ala 



w- 


-> 


401 








855 










860 










865 




E- 


-> 


403 


GCA GTG 


AAA 


TGG 


GCA 


AAG 


GCG 


ATA 


CTA 


GGC 


TTG 


AGA 


AAC 


TTA 


CAC 


CTC 


W- 


-> 


404 


3717 


































405 


Ala Val 


Lys 


Trp Ala 


Lys 


Ala 


He 


Leu Gly 


Leu 


Arg 


Asn 


Leu 


His 


Leu 


w- 


-> 


406 






870 










875 










880 






E- 


-> 


408 


GAT GAC 


CAA 


ATG 


ACC 


CTG 


CTA 


CAG 


TAC 


TCA 


TGG 


ATG 


TTT 


CTC 


ATG 


GCA 


W- 


-> 


409 


3765 


































410 


Asp Asp 


Gin 


Met 


Thr 


Leu 


Leu 


Gin 


Tyr 


Ser 


Trp 


Met 


Phe 


Leu 


Met 


Ala 


w- 


-> 


411 




885 










890 










895 








E- 


-> 


413 


TTT GCC 


TTG 


GGT 


TGG 


AGA 


TCA 


TAC 


AGA 


CAA 


TCA 


AGC 


GGA 


AAC 


CTG 


CTC 


W- 


-> 


414 


3813 


































415 


Phe Ala 


Leu 


Gly 


Trp 


Arg 


Ser 


Tyr 


Arg 


Gin 


Ser 


Ser 


Gly 


Asn 


Leu 


Leu 


w- 


-> 


416 


900 










905 










910 










E- 


-> 


418 


TGC TTT 


GCT 


CCT 


GAT 


CTG 


ATT 


ATT 


AAT 


GAG 


CAG 


AGA 


ATG 


TCT 


CTA 


CCC 


w- 


-> 


419 


3861 


































420 


Cys Phe 


Ala 


Pro Asp 


Leu 


He 


He 


Asn 


Glu 


Gin 


Arg 


Met 


Ser 


Leu 


Pro 


W- 


-> 


421 


915 








920 










925 










930 


E- 


-> 


425 


GGC ATG 


TAT 


GAC 


CAA 


TGT 


AAA 


CAC 


ATG 


CTG 


TTT 


GTC 


TCC 


TCT 


GAA 


TTA 


W- 


-> 


426 


3909 


































427 


Gly Met 


Tyr 


Asp 


Gin 


Cys 


Lys 


His 


Met 


Leu 


Phe 


Val 


Ser 


Ser 


Glu 


Leu 


W- 


-> 


428 








935 










940 










945 




E- 


-> 


430 


CAA AGA 


TTG 


CAG 


GTA 


TCC 


TAT 


GAA 


GAG 


TAT 


CTC 


TGT 


ATG 


AAA 


ACC 


TTA 


w- 


-> 


431 


3957 


































432 


Gin Arg 


Leu 


Gin 


Val 


Ser 


Tyr 


Glu 


Glu 


Tyr 


Leu 


Cys 


Met 


Lys 


Thr 


Leu 


w- 


-> 


433 






950 










955 










960 






E - 


-> 


435 


CTG CTT 


CTC 


TCC 


TCA 


GTT 


CCT 


AAG 


GAA 


GGT 


CTG 


AAG 


AGC 


CAA 


GAG 


TTA 


W- 


-> 


436 


4005 


































437 


Leu Leu 


Leu 


Ser 


Ser 


Val 


Pro 


Lys 


Glu 


Gly 


Leu 


Lys 


Ser 


Gin 


Glu 


Leu 


W- 


-> 


438 




965 










970 










975 








E- 


-> 


440 


TTT GAT 


GAG 


ATT 


CGA 


ATG 


ACT 


TAT 


ATC 


AAA 


GAG 


CTA 


GGA 


AAA 


GCC 


ATC 


W- 


-> 


441 


4053 


































442 


Phe Asp 


Glu 


He Arg 


Met 


Thr 


Tyr 


He 




Glu 


Leu 


Gly 


Lys 


Ala 


He 
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E--> 513 GCGGTATTTT CTCCTTACGC ATCTGTGCGG TATTTCACAC CGCATATGGT 

514 GCACTCTCAG 5194 

E--> 516 TACAATCTGC TCTGATGCCG CAT AG TT AAG CCAGCCCCGA CACCCGCCAA 

517 CACCCGCTGA 52 54 

E--> 519 CGCGCCCTGA CGGGCTTGTC TGCTCCCGGC ATCCGCTTAC AGACAAGCTG 

520 TGACCGTCTC 5314 

E--> 522 CGGGAGCTGC ATGTGTCAGA GGTTTTCACC GTCATCACCG AAACGCGCGA 

523 GACGAAAGGG 53 74 

E--> 525 CCTCGTGATA CGCCTATTTT TATAGGTTAA TGTCATGATA ATAATGGTTT 

526 CTTAGACGTC 54 34 

E--> 528 AGGTGGCACT TTTCGGGGAA ATGTGCGCGG AACCCCTATT TGTTTATTTT 

529 TCTAAATACA 54 94 

E--> 531 TTCAAATATG TATCCGCTCA TGAGACAATA ACCCTGATAA ATGCTTCAAT 

532 AATATTGAAA 5554 



E--> 534 AAG G AAG AG T ATG AG TATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT 

5 35 TTGCGGCATT 5614 

E--> 537 TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 

5 38 CTGAAGATCA 5 674 

E--> 540 GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC AGCGGTAAGA 

541 TCCTTGAGAG 57 34 




E--> 543 TTTTCGCCCC GAAGAACGTT TTCCAATGAT GAGCACTTTT AAAGTTCTGC 

544 TATGTGGCGC 5794 

E--> 546 GG TATTATCC CGTATTGACG CCGGGCAAGA GCAACTCGGT CGCCGCATAC 

547 ACTATTCTCA 5854 

E--> 549 GAATGACTTG GTTGAGTACT CACCAGTCAC AG AAAAG CAT CTTACGGATG 

550 GCATGACAGT 5914 

E--> 5 52 AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 

5 53 ACTTACTTCT 5974 

E--> 555 GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG CACAACATGG 

556 GGGATCATGT 60 34 

E--> 5 58 AACTCGCCTT GATCGTTGGG AACCGGAGCT GAATGAAGCC ATACCAAACG 

559 ACGAGCGTGA 6 0 94 

E--> 561 CACCACGATG CCTGTAGCAA TGGCAACAAC GTTGCGCAAA CTATTAACTG 

562 GCGAACTACT 6154 

E--> 564 TACTCTAGCT TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG 

565 TTGCAGGACC 6214 

E--> 567 ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 

568 GAGCCGGTGA 62 74 

E--> 570 GCGTGGGTCT CGCGGTATCA T TG C AG C AC T GGGGCCAGAT GGTAAGCCCT 

571 CCCGTATCGT 6334 

E--> 573 AGTTATCTAC ACGACGGGGA GTCAGGCAAC TATGGATGAA CGAAATAGAC 

574 AGATCGCTGA 63 94 

E--> 576 GATAGGTGCC TCACTGATTA AGCATTGGTA ACTGTCAGAC CAAGTTTACT 

577 CATATATACT 64 54 

E--> 579 TTAGATTGAT TTAAAACTTC ATTTTTAATT TAAAAGGATC TAGGTGAAGA 

580 TCCTTTTTGA 6514 

E--> 582 TAATCTCATG ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT 

583 CAGACCCCGT 6574 

E--> 585 AGAAAAGATC AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT 
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W--> 443 980 985 990 

E--> 446 GTC AAA AGG GAA GGG AAC TCC AGT CAG AAC TGG CAA CGG TTT TAC CAA 
W--> 447 4101 

448 Val Lys Arg Glu Gly Asn Ser Ser Gin Asn Trp Gin Arg Phe Tyr Gin 
W--> 449 995 1000 1005 1010 

E--> 451 CTG ACA AAG CTT CTG 
W--> 452 4149 

453 Leu Thr Lys Leu Leu 
W--> 454 101 
E--> 456 ACC TAC TGC TTC CAG 
W--> 457 4197 

458 Thr Tyr Cys Phe Gin 
W--> 459 1030 
E--> 461 CCA GAG ATG TTA GCT 
W--> 462 4245 

463 Pro Glu Met Leu Ala 
W--> 464 1045 
E--> 466 AAT GGA AAT ATC AAA 
W--> 467 4294 

468 Asn Gly Asn lie Lys 
W--> 469 1060 

E--> 471 AAGAAAGGTT GCCTTAAAGA AAGTTGAATT TATAG T C TAG AGTCGACCCG 

472 GGCGGCCGCT 4354 
E--> 474 TCGAGCAGAC ATGATAAGAT ACATTGATGA GTTTGGACAA ACCACAACTA 

475 GAATGCAGTG 4414 
E--> 477 AAAAAAATGC TTTATTTGTG AAATTTGTGA TGCTATTGCT TTATTTGTAA 

478 CCATTATAAG 4474 
E--> 4 80 CTGCAATAAA C AAG T T AAC A ACAACAATTG CATTCATTTT ATGTTTCAGG 

481 TTCAGGGGGA 4 534 
E--> 483 GATGTGGGAG GTTTTTTAAA GCAAGTAAAA CCTCTACAAA TGTGGTAAAA 

484 TCGATAAGGA 4 594 
E--> 486 TCCGGGCTGG CGTAATAGCG AAGAGGCCCG CACCGATCGC CCTTCCCAAC 

487 AGTTGCGCAG 4654 
E--> 489 CCTGAATGGC GAATGGACGC GCCCTGTAGC GGCGCATTAA GCGCGGCGGG 

490 TGTGGTGGTT 4714 
E--> 492 ACGCGCAGCG TGACCGCTAC ACTTGCCAGC GCCCTAGCGC CCGCTCCTTT 

4 93 CGCTTTCTTC 4 7 74 

E--> 495 CCTTCCTTTC TCGCCACGTT CGCCGGCTTT CCCCGTCAAG CTCTAAATCG 

4 96 GGGGCTCCCT 4 834 

E--> 4 98 TTAGGGTTCC GATTTAGAGC TTTACGGCAC CTCGACCGCA AAAAACTTGA 

499 TTTGGGTGAT 4894 
E~-> 501 GGTTCACGTA GTGGGCCATC GCCCTGATAG ACGGTTTTTC GCCCTTTGAC 

502 GTTGGAGTCC 4954 
E--> 504 ACGTTCTTTA ATAGTGGACT CTTGTTCCAA AC TG G AAC AA CACTCAACCC 

505 TATCTCGGTC 5014 
E--> 507 TATTCTTTTG ATTTATAAGG GATTTTGCCG ATTTCGGCCT ATTGGTTAAA 

508 AAATGAGCTG 5074 
E--> 510 ATTTAACAAA TATTTAACGC GAATTTTAAC AAAATATTAA CGTTTACAAT 

511 TTCGCCTGAT 5134 



GAC TCC ATG CAT GAG GTG GTT GAG AAT CTC CTT 

Asp Ser Met His Glu Val Val Glu Asn Leu Leu 
5 1020 1025 

ACA TTT TTG GAT AAG ACC ATG AGT ATT GAA TTC 

Thr Phe Leu Asp Lys Thr Met Ser lie Glu Phe 

1035 1040 
GAA ATC ATC ACT AAT CAG ATA CCA AAA TAT TCA 

Glu lie He Thr Asn Gin He Pro Lys Tyr Ser 

1050 1055 
AAG CTT CTG TTT CAT CAA AAA TGA CTGCCTTACT 

Lys Leu Leu Phe His Gin Lys * 
1065 1070 
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5 86 GCTGCTTGCA 6 6 34 

E--> 5 88 AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC 

589 TACCAACTCT 66 94 

E--> 591 TTTTCCGAAG GTAACTGGCT TC AG CAGAGC G C AG AT AC C A AATACTGTCC 

592 TTCTAGTGTA 6754 

E--> 594 GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC 

5 95 TCGCTCTGCT 6 814 
E--> 597 AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG 

598 GGTTGGACTC 6874 

E--> 600 AAG AC GAT AG TTACCGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT 

6 01 CGTGCACACA 6934 
E--> 603 GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC C T AC AG CG TG 

6 04 AGCTATGAGA 6994 

E--> 606 AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG 

607 GCAGGGTCGG 70 54 

E--> 609 AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT 

610 ATAGTCCTGT 7114 

E--> 612 CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG 

613 GGGGGCGGAG 7174 

E--> 615 CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT 

616 GCTGGCCTTT 7234 

618 TGCTCACATG GCT CGAC AG A TCT ^ j J \ 72 57 

621 (2) INFORMATION FOR SEQ ID NO: 2: J r\^)c\\ P> IS/ 

623 (i) SEQUENCE CHARACTER ISTICS : f ^ / 1 

624 (A) LENGTH :<^7^a'iitiino acids/ " ' * 
6 25 (B) CYPE: am uc ii I J)^^ \ \ 



626 (D) TOPOLOGY: linear 

628 (ii) MOLECULE TYPE: protein 

630 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



632 


Met 


Ala 


His 


His 


His 


His 


His 


His 


Gly 


Tyr 


Pro 


Tyr 


Asp 


Val 


Pro 


Asp 


633 


1 








5 










10 










15 




635 


Tyr 


Ala 


Gin 


Ser 


Ser 


Ala 


Met 


Ser 


Lys 


Gly 


Glu 


Glu 


Leu 


Phe 


Thr 


Gly 


636 








20 










25 










30 






638 


Val 


Val 


Pro 


He 


Leu 


Val 


Glu 


Leu 


Asp 


Gly 


Asp 


Val 


Asn 


Gly 


His 


Lys 


639 






35 










40 










45 








641 


Phe 


Ser 


Val 


Ser 


Gly 


Glu 


Gly 


Glu 


Gly 


Asp 


Ala 


Thr 


Tyr 


Gly 


Lys 


Leu 


642 




50 










55 










60 










644 


Thr 


Leu 


Lys 


Phe 


He 


Cys 


Thr 


Thr 


Gly 


Lys 


Leu 


Pro 


Val 


Pro 


Trp 


Pro 


645 


65 










70 










75 










80 


647 


Thr 


Leu 


Val 


Thr 


Thr 


Phe 


Thr 


Tyr 


Gly 


Val 


Gin 


Cys 


Phe 


Ser 


Arg 


Tyr 


648 










85 










90 










95 




650 


Pro 


Asp 


His 


Met 


Lys 


Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 


Pro 


Glu 


651 








100 










105 










110 






653 


Gly 


Tyr 


Val 


Gin 


Glu 


Arg 


Thr 


He 


Phe 


Phe 


Lys 


Asp 


Asp 


Gly 


Asn 


Tyr 


654 






115 










120 










125 








656 


Lys 


Thr 


Arg 


Ala 


Glu 


Val 


Lys 


Phe 


Glu 


Gly 


Asp 


Thr 


Leu 


Val 


Asn 


Arg 


657 




130 










135 










140 










659 


He 


Glu 


Leu 


Lys 


Gly 


He 


Asp 


Phe 


Lys 


Glu 


Asp 


Gly 


Asn 


He 


Leu 


Gly 


660 


145 










150 










155 










160 
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E--> 



c cz a 


His 


Lys 


Leu 


Glu 


Tyr 


Asn 


Tyr 


Asn 


Ser 


His 


Asn 


Val 


Tyr 


"IT 7 

ne 


Met 


Ala 


a £ "3 










165 










1 7 A 

I / u 










1 7c 




ODD 


Asp 


Lys 


Gin 


Lys 


Asn 


Gly 


lie 


Lys 


Val 


Asn 


Phe 


Lys 


T 7 — 

lie 


Arg 


His 


Asn 


£ £ 
Dot) 








i on 
loU 










I O z> 










1 Q A 

iy u 






^ q 
boo 


Tl 

lie 


Glu 


Asp 


Gly 


Ser 


Val 


Gin 


Leu 


Ala 


Asp 


His 


Tyr 


Gin 


Gin 


Asn 


Thr 


boy 






i y d 










a a a 
Z U U 










7f\C 
Z U D 








0 / I 


Pro 


lie 


Gly 


Asp 


Gly 


Pro 


Val 


Leu 


Leu 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


Ser 


C "7 A 




A 7 A 
^ IU 










A 7 K 

ZlD 










a a a 
ZZV 










O / 4 


Thr 


Gin 


Ser 


Ala 


Leu 


Ser 


Lys 


Asp 


Pro 


Asn 


Glu 


Lys 


Arg 


Asp 


Hxs 


Met 


D / D 


a a c 










230 










a o c 

Z JO 










A A A 


cz "~i n 


val 


Leu 


Leu 


Glu 


Phe 


Val 


Thr 


Ala 


Ala 


Gly 


lie 


Thr 


His 


Gly 


Met 


Asp 


£7Q 

O / O 










245 










ZDV 










Zdd 




ooU 


Glu 


Leu 


Tyr 


Lys 


Gly Ala 


Gly 


Ala 


Gly 


Ala 


Gly 


Ala 


Gly 


Ala 


lie 


Ser 


b ol 








A £ A 










0 £ r 










A 1 A 

z 1 u 






COO 
DOJ 


TV 7 _ 

Ala 


Leu 


lie 


Leu 


Asp 


Ser 


Lys 


Glu 


Ser 


Leu 


•» 7 _ 

Ala 


Pro 


Pro 


Gly 


Arg 


Asp 


a o a 
b o 4 






A 7 C 
Z / D 










O O A 

ZoU 










A Q K 








r o /; 
OOO 


Glu 


Val 


Pro 


Gly 


Ser 


Leu 


Leu 


Gly 


Gin 


Gly 


Arg 


Gly 


Ser 


Val 


Met 


Asp 


£ Q 7 

bo/ 




a o a 

z y u 










a a K 

z y D 










"3 A A 










boy 


Phe 


Tyr 


Lys 


Ser 


Leu 


Arg 


Gly 


Gly 


TV 7 _ 

Ala 


Thr 


Val 


Lys 


val 


Ser 


TV 7 _ 

Ala 


Ser 


b y u 


jU J 










310 










J Id 










0 z u 


b y ^ 


Ser 


Pro 


Ser 


val 


Ala 


Ala 


Ala 


Ser 


Gin 


Ala 


Asp 


Ser 


Lys 


Gin 


Gin 


Arg 


b y o 










325 










O "3 A 










7 O c 
3 3D 




£ a c 


lie 


Leu 


Leu 


Asp 


Phe 


Ser 


Lys 


Gly 


Ser 


Thr 


Ser 


Asn 


Val 


Gin 


Gin 


Arg 


b y b 








j4U 










j4j 










OCA 






£T n o 

b y o 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


6 9 9 






3 55 










3 6 0 










365 








i a i 
/ UI 


Gin 


Gin 


Gin 


Pro 


Gly 


Leu 


Ser 


Lys 


Ala 


Val 


Ser 


Leu 


Ser 


Met 


Gly 


Leu 


7 n o 




o / U 










07c 










joU 










7 a /i 


Tyr 


Met 


Gly 


Glu 


Thr 


Glu 
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FOR 
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887 (i) SEQUENCE CHARACTERISTICS: 

888 (A) LENGTH: 60 base pairs 

889 (B) TYPE: nucleic acid 

890 (C) STRANDEDNESS : single 
891 
893 

8 96 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



(D) TOPOLOGY: linear AX^L ^ ^ 

(ii) MOLECULE TYPE: oligonucleotide K ^J^7^<-J-~+ ^ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: A~s 
E--> 898 TCGA££GCGC AAGAACACAG TGTTCTGACG ACACGAAGAA CAGGATGTTC 9 

8 99( TCGTACAGTG ~~~5Tp ' " §fl L-~ 

902 ( 2 f~I NFORMATToN~FOR SEQ ID NO: 7: 1 . ^4^^^ 



i QUEN EC] RACTER] SI 1 2S ^ 

905 (A) LENGTH: 60 base pairs 

906 (B) TYPE: nucleic acid 

907 (C) STRANDEDNESS: single 

908 (D) TOPOLOGY: linear 
910 (ii) MOLECULE TYPE: oligonucleotide 

913 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 

E--> 915 TCGACACTGT ACGAGAACAT CCTGTTCTTC GTGTCGTCAG AACACTGTGT /\ JIjmJ JLAAJ^ 

916 TCTTGCGCGC 60 

919 (2) INFORMATION FOR SEQ ID NO : 8: 

921 (i) SEQUENCE CHARACTERISTICS: 

922 (A) LENGTH: 60 base pairs 

923 (B) TYPE: nucleic acid 

924 (C) STRANDEDNESS: single 

925 (D) TOPOLOGY: linear 
*27 (ii) MOLECULE TYPE : oligonucleotide 

930 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 

E--> 932 TCGAGCGCGC AAGGTCACAG TGACCTGACG ACACGAAGGT CAGGATGACC 

933 TCGTACAGTG 60 

93 6 (2) INFORMATION FOR SEQ ID NO : 9: 

938 (i) SEQUENCE CHARACTERISTICS 



939 (A) LENGTH: 60 base pairs iff 

fir 
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940 (B) TYPE: nucleic acid 

941 (C) STRANDEDNESS : single 

942 (D) TOPOLOGY: linear 
944 (ii) MOLECULE TYPE: DNA (genomic) 
947 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

E--> 949 TCGACACTGT ACGAGGTCAT CCTGACCTTC GTGTCGTCAG GTCACTGTGA 
950 CCTTGCGCGC 6 0 
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L:34 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER : ] 

L:35 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 

L:38 M:220 C: Keyword misspelled or invalid format, [(vii) PRIOR APPLICATION DATA:] 

L:59 M:220 C: Keyword misspelled or invalid format, [(D) TOPOLOGY:] 

L:61 M:246 W: Invalid value of Alpha Sequence Header Field, [MOLECULE TYPE:], SeqNo=l 

L:70 M:254 E: No. of Bases conflict, lnput:0 Counted:50 SEQ : 1 
M:254 Repeated in SeqNo=l 
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L:232 
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L: 247 
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M: 336 W: 
M: 336 W: 
M: 336 W: 
M: 336 W: 
M: 336 W: 
M : 3 3 6 W : 
M:332 E: 
Repeated 
M:342 E: 
M:246 W: 
M:246 W: 
M:246 W: 
M:246 W: 
M:254 E: 
M: 246 W: 
M:254 E: 
M: 246 W: 
M:254 E: 
M:254 E: 



VERIFICATION SUMMARY DATE: 03/25/2 002 

PATENT APPLICATION: US/10/001, 4 86 TIME: 14:48:41 

input Set : A:\W065457.txt 

Output Set: N:\CRF3\03252002\J001486.raw 

Invalid Amino Acid Number in Coding Region, SEQ ID:1 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
Invalid Amino Acid Number in Coding Region, SEQ ID:1 
(32) Invalid/Missing Amino Acid Numbering, SEQ ID: 2 
in SeqNo^2 

Invalid Stop Code On Error, STOP CODON:* 
Invalid value of Alpha Sequence Header Field, 
Invalid value of Alpha Sequence Header Field, 
Invalid value of Alpha Sequence Header Field, 
Invalid value of Alpha Sequence Header Field, 
No. of Bases conflict, Input :0 Counted: 50 SEQ 
Invalid value of Alpha Sequence Header Field, 
No. of Bases conflict, Input :0 Counted : 50 SEQ : 7 
Invalid value of Alpha Sequence Header Field, [MOLECULE TYPE:] 
No. of Bases conflict, Input :0 Counted: 50 SEQ: 8 
No. of Bases conflict, Input :0 Counted: 50 SEQ: 9 



[MOLECULE TYPE: ] 
[MOLECULE TYPE: ] 
[MOLECULE TYPE: ] 
[MOLECULE TYPE: ] 
6 

[MOLECULE TYPE: ] , 



SeqNo=3 
SeqNo=4 
SeqNo=5 
SeqNo=6 

SeqNo=7 

SeqNo-8 



file://C:\CRF3\Outhold\VsrJ001486.htm 



3/25/02 



